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Abstract 

To complement traditional influenza surveillance with data on disease occurrence not only among care-seeking individuals, 
the Swedish Institute for Communicable Disease Control (SMI) has tested an Internet-based monitoring system (IMS) with 
self-recruited volunteers submitting weekly on-line reports about their health in the preceding week, upon weekly 
reminders. We evaluated IMS acceptability and to which extent participants represented the Swedish population. We also 
studied the agreement of data on influenza-like illness (ILI) occurrence from IMS with data from a previously evaluated 
population-based system (PBS) with an actively recruited random sample of the population who spontaneously report 
disease onsets in real-time via telephone/Internet, and with traditional general practitioner based sentinel and virological 
influenza surveillance, in the 2011-2012 and 2012-2013 influenza seasons. We assessed acceptability by calculating the 
participation proportion in an invited IMS-sample and the weekly reporting proportion of enrolled self-recruited IMS 
participants. We compared distributions of socio-demographic indicators of self-recruited IMS participants to the general 
Swedish population using chi-square tests. Finally, we assessed the agreement of weekly incidence proportions (%) of ILI in 
IMS and PBS with cross-correlation analyses. Among 2,511 invited persons, 166 (6.6%) agreed to participate in the IMS. In 
each season, 2,552 and 2,486 self-recruited persons participated in the IMS respectively. The weekly reporting proportion 
among self-recruited participants decreased from 87% to 23% (2011-2012) and 82% to 45% (2012-2013). Women, highly 
educated, and middle-aged persons were overrepresented among self-recruited IMS participants (p<0.01). IMS (invited and 
self-recruited) and PBS weekly incidence proportions correlated strongest when no lags were applied (r = 0.71 and r = 0.69, 
p<0.05). This evaluation revealed socio-demographic misrepresentation and limited compliance among the self-recruited 
IMS participants. Yet, IMS offered a reasonable representation of the temporal ILI pattern in the community overall during 
the 2011-2012 and 2012-2013 influenza seasons and could be a simple tool for collecting community-based ILI data. 
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Introduction 

The societal consequences of influenza - whether it is seasonal 
or pandemic - include loss of production, exhaustion of health 
care resources, and excess mortality [1]. When monitoring the 
recurring epidemics valid incidence data in close-to-real-time can 
be of value. Traditionally, influenza surveillance is based on 
information from health care-based sources, including clinical and 
virological data [2]. However, influenza is often mild, does not 
always require health care and the proportion of iU people who see 
their general practitioner (GP) may be context dependent [3]. 
Therefore, traditional influenza surveillance may not generate 
valid representations of an epidemic in the community. To 
supplement traditional influenza surveillance systems with com- 
munity-based information, the Swedish Institute for Communica- 
ble Disease Control (SMI) has tested two different prospective 



ways of collecting data on influenza-like illness (ILI) from the 
general population. Since 2007, a population-based system (PBS) 
uses cohorts established through annual random sampling of the 
target population. The cohort members provide event-driven, self- 
initiated reports via automated technologies as soon as a 
respiratory tract infection occurs. The PBS has been evaluated 
previously [4,5]. Since 20 11, an Internet-based monitoring system 
(IMS) with self-recruited participants provides reports of partici- 
pants' recent health status, which is collected upon a weekly 
reminder automatically dispatched via e-mail. The latter method 
was developed in the Netherlands in 2003 [6,7]. SMI has 
compared representativeness and the obtained surveillance data 
in IMS and PBS during two seasons within the EPIWORK 
project, a European commission seventh frame work programme 
consortium aiming to build the foundation for an infrastructure to 
generate epidemic forecasts [8]. We regarded PBS as the best 
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available and most rigorously evaluated method for community- 
based surveillance and therefore used it as the reference standard 
to IMS. 

The self-selection involved in the recruitment of IMS partici- 
pants has raised concerns about the vahdity of obtained ILI 
incidence data and its ability to reflect epidemics in the 
community. Reports from IMS in other European countries 
suggest that the self-recruited sample misrepresents its target 
population, but that ILI patterns correlate well with health care- 
based ILI data [6,7,9-11]. However, comparison of IMS with 
corresponding data generated from the more representative PBS 
cohort may provide better insights about the validity of disease 
occurrence data and the need, if any, for calibration of ILI 
estimates to correct for potential systematic errors. In this study, 
we first assessed the acceptability of IMS among randomly selected 
individuals who were invited to participate. Second, we assess(;d 
the representativeness of self-recruited IMS participants and 
compared it to that of the invited participants. Third, we 
compared IMS and PBS data in terms of ILI occurrence across 
the 2011-2012 and 2012-2013 influenza seasons (henceforth 
referred to as season 1 and season 2, respectively). We also related 
data from both systems to concurrently collected data from regular 
influenza surveillance: the laboratory reports from routine 
diagnostics and the GP based sentinel surveillance [12]. 

Methods 

Ethics Statement 

The IMS and PBS were reviewed and approved by the 
Stockholm regional research ethics review board (IMS: 201 1/387- 

31/4, 2012/144,5-32/4 and PBS: 2007/952-31, 2007/1599-32, 
2008/1227-32, 2009/752-31, 2010/237-31/5, 2012/1444-32/5). 

The Internet-based Monitoring System (IMS) 

IMS is a Swedish adaptation of the European-wide Influenzanet 
[13,14]. During season 1, we recruited participants using press 
releases and resulting media attention, and by interpersonal 
communication through social media channels (henceforth 
referred to as self-recruited participants). Interested presumptive 
participants were directed to the project website. In season 2, we 
re-contacted participants from the previous season via e-mail with 
an invitation to participate again and recruited new participants as 
described above. 

In season 2 we also investigated the possibility of implementing 
a population-based variant of IMS. We drew a random sample of 
2,511 persons of the Swedish population aged 3 months through 
95 years from the population register and invited them by post to 
participate in IMS (henceforth referred to as invited participants). 
The distribution of socio-demographic indicators of the random 
sample are available in Table SI. 

Both self-recruited and invited participants (henceforth referred 
to as all IMS participants) initiated their participation by providing 
an e-mail address in a password-protected user account and by 
completing a background questionnaire on the project website. A 
user account could include one or several participants, enabling 
parents to report on behalf of children <16 years. Participants 
received weekly e-mails that prompted them to visit the website 
and record occurrence of 18 listed symptoms or absence of 
symptoms in the preceding week. Upon affirmation of one or more 
symptoms, the participant was presented with follow-on questions 
(e.g. date of symptom onset). The components of the IMS across 
seasons are presented in Table 1. 



The Population-based System (PBS) 

Population-based surveillance uses a sample from the general 
population, defined by geopolitical boundaries which constitutes 
the denominator, and/or the sampling frame [15]. In the PBS we 
collected data directly from individuals that had been recruited for 
the surveillance, based on a random sample of the general 
population. Descriptions and evaluations of the PBS have been 
presented previously [4,5,16]. Briefly, (-ach year we invited 
representative samples of the general population, 3 months 
through 95 years of age to participate in PBS. Participants were 
instructed to spontaneously report all new episodes of colds and 
fevers within seven days of symptom onset from September/ 
October through May via a secure website or a telephone based 
interactive voice response system. When reporting, participants 
answered an automated, tree-structured symptom questionnaire. 
Due to non-participation, there is moderate ()V('r-rc'pr(;sc'ntation of 
elderly people, women, well-educated individuals, people with a 
high household income, married people, and people living in two- 
person households [4] . Evaluations have shown that the telephone 
service seems to be particularly attractive for elderly and low- 
educated people, but the reporting technology per se does not 
appear to affect the reporting [4]. A validation study revealed 
substantial under-reporting, which was remarkably constant over 
time and across seasons, thus allowing simple adjustments [5]. 

After confinement to Stockholm County for five years, PBS was 
extended to all of Sweden in the 2012-2013 season. In season 1, 
2,580 out of 14,022 persons sampled from Stockholm's population 
participated in PBS. In season 2, 2,236 out of 14,558 people 
sampled from all over Sweden participated. The components of 
the PBS across seasons are presented in Table 1. The age and sex 
distributions of PBS participants during season 1 and 2 are 
available in Table S2. 

Evaluation Analysis 

All analyses were based on data collected between November 
and May in season 1 and season 2, respectively. Each season lasted 
27 weeks and was analysed independentiy. 

All IMS participants who created an account in the system but 
never submitted a weekly report, participants with missing 
information on postcode, and participants with a birth date in 
the future, were excluded from analysis. Further, incomplete 
reports, reports where symptom onset preceded participation, and 
reports with a future date of symptom onset were excluded. 

Definitions. Self-selection into IMS may lead to preferential 
inclusion of people with symptoms and preferential re-entry of 
people with symptoms after temporary periods of non-reporting. 
Therefore, we only included reports preceded by at least one 
report in the previous three weeks (henceforth referred to as active 
reports) for our incidence calculations. To explore how the 
definition of active participation affected disease patterns, we 
defined a strictly active report as a report preceded by two 
consecutive reports in the previous two weeks in a supplementary 
analysis (henceforth referred to as strictly active). 

We defined a report of illness in IMS or PBS as ILI if it included 
sudden onset of symptoms AND at least one of the following 
systemic symptoms: fever or feverishness, headache, or myalgia, 
AND at least one of the following respiratory symptoms: cough, 
sore throat, shortness of brc'ath, or coryza. Coryza was omitted 
from the case definition in season 2. We calculated the weekly 
incidence proportions (%) among all IMS participants by dividing 
the number of active ILI reports by the total number of active 
reports in that week. We calculated the weekly incidence 
proportions among PBS participants by dividing the number of 
ILI reports by the total number of cohort members in that week. 
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Table 1 . Summary of system components of the IMS and PBS during the influenza seasons 201 1 -201 2 and 201 2-201 3. 





Component 


ScdaUII 1 1 1 


-201 2) 


Season 2 (2012-2013) 




IMS 


PBS 


IMS 


PBS 


Invited sample 




14,022 


2,511 


14,558 


Invited participants (response proportion %) 




2,580 (18) 


166 (7) 


2,236 (15) 


Self-recruited participants 


2,552 




2,486 




Geographical area 


Sweden 


Stockholm 


Sweden 


Sweden 


Calendar weeks operating 


46-20 


38-20 


47-21 


44-20 



doi:l 0.1 371/journal.pone.0096740.t001 



We corrected PBS ILI rates for previously estimated misrepre- 
sentation of demographic substrata [4] and under-reporting [5] . 

To evaluate the performance of IMS for surveillance in close- to- 
real-time, episodes that started >7 days before the reporting date 
were excluded from the analysis. Symptoms fitting the case 
definition in two consecutive weekly reports were considered to 
represent the same episode of illness and only the first report was 
included in the analysis. 

Acceptability by participants. To measure the acceptability 
[1 7] in terms of willingness of persons to participate in IMS, we 
calculated the proportion of participation among the invited 
participants. We also calculated the weekly response proportion 
among all enrolled IMS participants by dividing the number of 
reports during the week in question by the accumulated number of 
participants enrolled up to that week (for all reports and for active 
reports only, separately for each season). In season 2, we also 
stratified by self-recruitment or recruitment by invitation. The 
reports and active reports were summarized by the total, mean and 
median number per participant across each season. Additionally, 
we calculated the median proportion of complete reports per 
individually acquired participation time (time from registration 
week until the season's last week). 

Representativeness of the participants. To assess the 
representativeness [17] of self-recruited and invited IMS partic- 
ipants, we compared participants and the general Swedish 
population in terms of distributions of age, sex, level of education, 
and county of residence, using chi-square tests. We analysed 
invited and self-recruited IMS participants separately. For the 
invited sample in season 2, we collapsed the 21 counties of 
residence into three regions [Gbtaland, Southern Sweden; Svealand, 
Central Sweden; and Morrland, Northern Sweden) due to small 
numbers in many counties. We performed a supplementary 
analysis of representativeness including only participants who had 
contributed at least one active report. We considered p-values < 
0.05 as significant. 

Time series of ILI data. W e compared ILI occurrence data 
from IMS (based on reports from all IMS participants) to 
incidence data from PBS (corrected for estimated demographic 
misrepresentation [4] and underreporting [5]). To examine the 
possible presence of systematic differences between the two 
methods that could be amenable to simple calibration, we 
compared incidence proportions week by week and across seasons 
with particular reference to periods with known increased 
influenza activity. We applied Bland- Altman plots [18] and 
method comparison techniques [19] to determine if observations 
from both methods directly agreed, and if not, if they agreed after 
mathematical transformation of the data. We also studied cross- 
correlations of the incidence proportions [20]. Further, we studied 
the cross-correlation of IMS and PBS incidence proportions with 



ILI data generated by the GP-based sentinel surveillance system 
(weekly number of ILI cases per 1000 000 listed patients) and 
laboratory reports (number of laboratory-confirmed influenza 
cases per week). Before analysis, we smoothed the weekly 
incidence proportions using a two-week moving average. We 
plotted each time series and calculated Spearman correlation 
coefficients (r) on ranked data for different lags (+/ — 5 weeks) 
between: IMS and PBS; IMS and laboratory data; PBS and 
laboratory data; IMS and GP-based sentinel data; and PBS and 
GP-based sentinel data. 

Since PBS was applied only in Stockholm County during season 
1, we also restricted IMS data to Stockholm County only. 
However, due to small numbers in the GP-based sentinel data 
from Stockholm, these were not included in the season 1 analysis. 
For season 2, we included all four surveillance systems and made 
all comparisons at the national level. 

In order to examine if the time series comparison would 
improve after attempts to correct the incidence proportion 
according to the general Swedish population, we performed a 
supplementary analysis based on weighted IMS data. We weighted 
the IMS sample by assigning each participant a weight 
calculated with the formula [2 1] Wpartidpant = Pswedish population/ 
PiMS participants (whcrc Wp„,;,-;p^„t = wclght of cach IMS partici- 
pant, Pswedish population = proportion of the general population 
of Sweden in the same age and sex group as the participant and 
PiMS partidpants = proportiou of thc IMS sample in the same age 
and sex group as the participant). 

Results 

Acceptability 

During, season 1 and season 2, respectively 2,552 and 2,486 
self-recruited IMS participants submitted at least one report. Of 
2,51 1 randomly selected residents who were invited to IMS, 166 
(6.6%) signed up to participate and submitted at least one report. 

In season 1, as the number of participants increased, the 
number of reports per week increased gradually until week 9 of 
2012, when it started decreasing (Table 2). The weekly proportion 
of participants reporting was highest (87%) in the first week but fell 
almost monotonicaUy to its lowest value (23%) in the last week of 
the season. The reporting proportion counting only active reports 
increased to 50% after the first three weeks with comparable levels 
in the following three weeks, but then it gradually fell to its lowest 
point (21%) in the last week. The median number of total reports 
and active reports per participant was 4 (range 1-27) and 3 (range 
0-26). During calendar weeks 8, 9, and 10, coinciding with the 
season's influenza peak, the cumulative number of reporting 
participants increased, and the active-to-total report ratio was the 
greatest. Many participants also joined the system in these weeks. 
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Based on the individually acquired participation time, the median 
completion proportion of all possible reports and active reports 
were 27% (range: 4-100) and 17% (range: 0-96) respectively. 

In season 2, the number of reports per week also increased 
gradually in the beginning, when the influx of participants was 
grcat(\st (Table 2). In contrast to season 1, however, the weekly 
number of reports remained constant throughout the season, with 
the exception of a dip in calendar week 12 due to a technical 
malfunction of the website. Notwithstanding this stability, the 
weekly reporting proportion among self-recruited IMS partici- 
pants fell slowly across the season, from its highest (82"/()) in the 
first two weeks to 45 % in the last week. The reporting proportion 
counting only active reports peaked (68%) in weeks 50 and 3 and 
was 43% in the last week. The median number of total reports (13, 
range 1-26) and of active reports (11, range 0-25) was higher than 
in season 1. Based on the individual participation duration, the 
median completion proportion of all possible reports and active 
reports among self-recruited IMS participants were 64% (range: 
4—100) and 57% (range: 0-96) respectively. 

In season 2, the median number of reports and active reports per 
registered participant were, respectively, 62% (21 vs. 13, p<0.01) 
and 82% (20 vs. 11, p<0.01) higher among invited IMS 
participants than among the self-recruited ones. Disrc-garding 
weeks 1 2 and 1 3 (affei:ted by the malfunctioning website in week 
1 2), the lowest proportion of participants reporting counting only 
active reports among the invited participants (67% in week 20) was 
of the same magnitude as the highest proportion among the self- 
recruited (68% in week 3). Based on the individual participation 
duration, the median completion proportion of all possible reports 
and active reports among invited IMS participants were 84% 
(range: 4—100) and 79% (range 0-96) respectively. 

Representativeness 

For both seasons and irrespective of how participation was 
defined, self-recruited IMS participants were more likely to be 
female, university educated and aged 40-64 than the general 
population (p<0.01 for each comparison, Table 3). The 
geographical distribution of participants differed from the Swedish 
population (p<0.01). For instance, 29% (season 1) and 34% 
(season 2) of the self-recruited participants resided in Stockholm 
County compared with only 22% of the Swedish population. In 
both seasons, only 1 1 % of participants resided in the Swedish 
county containing the second largest city Gothenburg; this county 
accommodates 17% of the Swedish population. 

The age and sex distributions among invited IMS participants 
differed less from the Swedish population than did the corre- 
sponding distributions among the- self-recruited (Table 3). The 
under-representation of the 0-17 and £65 year age groups, 
though statistically significant, was less marked (p = 0.01). With 
reservation for the small numbers, the geographical distribution of 
invited participants was similar to that of the Swedish population 
(p = 0.75). 

Comparison of Time Series 

Season 1 (Stockholm County 2011-2012). Smootiied 
weekly ILI incidence proportions ranged between 0.6-4.4% 
(IMS) and 0.8-2.8% (PBS). IMS reached its peak in week 7, 
whereas PBS and laboratory reports of influenza diagnoses 
reached their peaks in week 9 (Figure 1). 

IMS correlated with PBS (p<0.05) with the largest coefficient 
(r = 0.71) when no lag was applied. The correlation was stiU 
significant with a shift of PBS data back in time (lead) by up to two 
weeks (r - 0.56) and with shift of PBS data forward in time (lag) by 
one week (r = 0.59). IMS correlated with laboratory data (p<0.05) 



with the highest correlation without a lag (r = 0.77). The 
correlation was till significant with a shift of laboratory data by 
two weeks back in time (r = 0.40) and by one week forward in time 

(r = 0.65). PBS also correlated with laboratory data (p<0.05) with 
the highest correlation without a lag (r = 0.63). However, 
correlations were also significant wit\\ a shift of laboratory data 
by one week back in time (r — 0.49) and two weeks forward in time 
(r = 0.46). 

Season 2 (Sweden 2012-2013). The corresponding curves 
for s(;ason 2, pertaining to all of Sweden, also include the weekly 
number of ILI cases among listed patients reported in the GP- 
based sentinel system (Figure 1). The smoothed weekly incidence 
proportions of reported ILI ranged between 1.1-2.8% in IMS and 

0. 7,3% in PBS. 

IMS correlated with PBS (p<0.05) with the maximum 
coefficient (r = 0.69) when no lag was applied, but the correlation 
remained significant with a shift of PBS data back in time by up to 
two weeks (r = ().54) and forward in time by up to two weeks 
(r = 0.47). IMS correlated with laboratory data (p<0.05) with the 
maximum coefficient without a lag (r = 0.56) and with GP sentinel 
data with a shift of GP sentinel data one week back in time 
(r = 0.61). Correlations were also significant between two weeks 
lead (laboratory: r = 0.44, sentinel: r = 0.54) and two weeks lag 
(laboratory: r = 0.48, sentinel: r = 0.51) of laboratory and sentinel 
data, respectively. PBS correlated with laboratory and sentinel 
data (p<0.05) with the maximum coefficient at a four week lag for 
laboratory data (r = 0.50) and sentinel data (r = 0.55), respectively. 
Correlations were also significant between one (r = 0.42) and five 
weeks lag (r = 0.49) of laboratory data and between zero (r = 0.47) 
and five (r = 0.50) weeks lag of sentinel data. 

Exploring a stricter definition of active 
participation. When applying stricter criteria to define a report 
as strictly active, as opposed to the active definition used in the main 
analysis, the IMS weekly incidence proportions were overall lower 
in both seasons, but the differences were gcnc-ralh' small. In season 

1, estimates of ILI incidence proportions based on active reports 
were on average 0.6 percentage points (median 0.6 range: —0.8 to 
1 .8) higher than those derived from strictly active reports. In season 

2, the differences were generally smaller, on average 0.1 
percentage units (median 0.1 range: —0.2 to 0.4), and of similar 
size across the entire season. 

Exploring time-series comparison based on weighted 
IMS-data, The smoothed weekly weighted ILI incidence 
proportions ranged between 0.9-4.9% in season 1 and 1.3-3.0% 
in season 2. In both seasons, the weighted incidence proportions 
were similar to the crude incidence proportions for most of the 
weeks. However, the weighted data produced peaks in the 
beginning of both seasons at a similar levels of incidence 
proportions as the peak that coincided with the influenza peak 
according to laboratory data. Additionally in season I, the 
weighted data produced a peak towards the end of th(- season. 
Weighted IMS data correlated weaker to all the other surveillance 
data sources than the crude IMS data did (data not shown). 

Difference between weekly estimates. In season 1, the 
weekly estimates of ILI incidence proportions generated by IMS 
were on average 0.9 percentage units (median 1.0) higher than 
those derived from the PBS. By and large, the differences were 
constant across the season, with the greatest (but also the most 
variable) differences in the weeks when inffuenza activity was 
increasing according to laboratory reports. The differences tended 
to subside towards the end of the season. The Bland- Altman plot 
suggested that the two systems did not agree, but rather that IMS 
tended to give higher ILI estimates than PBS (Figure 2). The 
transformation of IMS to PBS (PBS = 0.83-h0.35*IMS) had 95% 
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Figure 1. Epidemic curves 2011-2012 and 2012-2013. The upper graph shows the smoothed weekly III incidence proportions generated by 
IMS and PBS (corrected for estimated demographic misrepresentation [4] and underreporting [5]) and number of laboratory confirmed influenza 
cases, Stockholm 201 1-2012. The lower graph shows the smoothed weekly ILI incidence proportions generated by IMS (based on self-recruited and 
invited participants) and PBS (corrected for estimated demographic misrepresentation [4] and underreporting [5]), number of laboratory confirmed 
influenza cases, and ILI per 1,000,000 listed patients in GP-sentinel reports, Sweden 2012-2013. 
doi:1 0.1 371 /journal.pone.0096740.g001 
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Stockholm County 2011-2012 



IMS = -2.41+2.89PBS (1.69) 
PBS = 0.83+0.35IMS (0.59) 




( IMS + PBS ) / 2 



Sweden 2012-2013 




(IMS + PBS)/2 



Figure 2. Bland-Altman plots 201 1 -201 2 and 201 2-201 3. The upper graph shows a Bland-Altman plot of data from the 201 1 -201 2 season 
and the lower graph shows a Bland-Altman plot of data from the 2012-2013 season. The black dots represents the differences of the weekly 
incidence proportions between the IMS and PBS (y) by the average of the IMS and PBS weekly incidence proportions (x). The thick blue line 
represents a simple linear regression model of the differences on the averages, while the thin blue lines represent the respective 95% limits of 
agreement. The limits of agreement for the difference between the IMS and PBS can be calculated from the equation in the bottom of the graphs, 
when their average is known. With the equations at the top of the graphs one system's incidence proportions can be transformed to the other. 
doi:1 0.1 371 /journal.pone.0096740.g002 



prediction limits of magnitude ±1.18%; i.e. the true PBS 
incidence proportion would fall within ±1.18% of the estimate 
with 95% certainty. 

In contrast, in season 2 the mean and median differences were 
— 0.25 and —0.15 percentage points, respectively, indicating that 
PBS generated higher estimates most of the weeks, but the 
differences were generally smaller. The greatest differences were 
seen in the beginning of the season and when influenza activity 
had passed its peak according to the laboratory reports. A Bland- 



Altman plot suggested that IMS tended to give higher estimates 
when the average of IMS and PBS incidence estimates were below 
1.5%, and that PBS generated higher estimates for higher means 
(Figure 2). The 95% prediction limits after transformation of IMS 
to PBS (PBS= -0.94-H.62*IMS) were of magnitude ±1.36%. 

Discussion 

This evaluation of IMS suggested that self-recruitment led to an 
overrepresentation of women, highly educated and middle-aged 
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persons. The weekly proportion of participants that reported 
decreased gradually throughout both seasons. Furthermore, only a 
small proportion of the invited sample participated in IMS. 
Although the generated ILI-e.stimates differed from PBS estimates, 
especially the first season, IMS data correlated significandy with 
PBS data and with data from the traditional influenza surveillance 
systems. 

Our findings are consistent with previous assessments of 
Influenzanet that suggested that self-recruited layper.son-based 
influenza surveillance systems could detect changes in ILI 
incidence in the population to give signals of the start and 
culmination of influenza epidemics with reasonable accuracy 
[6,7,9-11]. The present study adds by providing a comparison 
with a well-validated population-based surveillance system that 
generated incidence data in real time, corrected for previously 
quantified demographic misrepresentation [4] and under-report- 
ing [5]. 

A recent multi-country analysis of data from seven Influenzanet 
countries (including Sweden), found that participants with fewer 
years of education and of younger ages had a lower compliance to 
complete the weekly report [22], suggesting that further selection 
bias may be introduced as the season proceeds. This is illustrated 
in our analysis as the accumulated number of self-recruited IMS 
participants did not parallel the number of reports and active 
reports submitted each week. Interestingly, the difference between 
the total number of reports and the total number of active reports 
was generally small, and the total number of reports was fairly 
stable across the entire season, particularly in season 2. The 
majority of participants in season 2 responded to more than half of 
their r(;rjuired reports further implying that the overwhelming 
number of reports came from faithful participants who reported 
regularly. Possibly, they reported during limited periods not 
necessarily starting at the beginning of the season. A notable 
exception was the period from February through March during 
season 1, when influenza peaked [12]. At this time, the active-lo- 
total report ratio, in particular, fell noticeably. Many participants 
entered the system, possibly prompted by their own ILI and/or by 
increased attention to IMS produced by the peak itself 

The requirement of at least one report in the three weeks 
preceding each new report does not entirely rule out preferential 
re-entry of sick participants. However, requiring two consecutive 
reports in the preceding two weeks, practically precluding 
preferential re-entry of sick participants, resulted in incidence 
curves with shapes that were trivially different from those based on 
the more relaxed active report definition. This suggests that re-entry 
may only be a minor problem. Nevertheless, weak selection bias in 
IMS during the influenza peak might have amplified and 
improved the signal, thus making the peak more distinct in IMS 
tiian in PBS. 

The reporting activit}' le\cl among IMS participants improved 
in season 2. This may be due to inclusion of moti\'ated participants 
from the previous season and a conei^ntration of marketing efforts 
to the beginning of the season rather than continuously. The 
higher reporting frequency in the invited sample indicates that this 
group was even more motivated to report regularly. Although a 
seven percent response proportion among the invited residents 
likely enriched particularly motivated participants, the detailed 
information in the postal invitation may also have contributed to 
the regular reporting. Furthermore, the step from reex-iving the 
paper invitation to registering online may have demanded more 
motivation to participate than reading about IMS online, where 
registration is only "a mouse click away". 

The underrepresentation of the youngest age groups among 
self-recruited IMS participants may be due to modest emphasis on 



the possibility for legal guardians to act as proxy participants for 
their children. In the oldest age groups, limited Internet 
availability and computer literacy may have prevented participa- 
tion [23]. Interestingly, despite the poor participation rate, the 
randomly selected, invited population sample was more represen- 
tative with regard to age, sex and education. 

Notwithstanding the misrepresentation of self-recruited IMS 
participants, the overall epidemic curves were similar to those 
generated by PBS. This suggests that neither the measured socio- 
demographic factors nor unmeasured determinants of participa- 
tion were strongly associated with the risk of ILI. While IMS may 
detect the start and peak of influenza epidemics, the continuous 
monitoring of absolute incidence rates in various substrata of the 
population may be better accomplished with PBS. Age group- 
specific data is of particular interest because immunity and the 
predisposition to complications differ across ages [24]. Since the 
representation of specific age groups in self-recruited IMS is 
variable and poor in the elderly, the validity of incidence data in 
this group is uncertain. The pattern of misrepresentation of 
participants was similar in PBS [4] , but the deviations were smaller 
compared with self-recruited IMS. Phone reporting offered by 
PBS may explain better representation of older age groups [4]. 
When weighting the IMS sample according to the age and sex 
distribution of the Swedish population, the epidemic curves 
deviated more from the PBS. The weighted estimates may give 
a less biased cross-sectional estimate, but interpretation of time 
series stays as complicated as for crude estimates due to the weekly 
variations in stratum specific reporting activity. However, it is 
reassuring that even misrepresented self-selected populations are 
capable of describing an epidemic of influenza-like illness with a 
reasonable accuracy. 

The diflerences between incidence proportions generated by 
IMS and PBS varied across and between the seasons. Based on the 
Bland-Altman plots and method comparison techniques, the 
prediction limits in the transformation of the estimates of one 
system to the other were of a magnitude that we consider 
unacceptable for the purpose of surveillance. Notably, the 
incidence according to PBS was unprecedentedly high from 
November until the peak in March of season 2 and did not 
coincide with the laboratory confirmed influenza data. The 
reasons for this deviation of PBS data compared to previous 
seasons remain speculative but may relate to the epidemiology of 
respiratory infections. First, the 2012-2013 influenza season was 
unusually long, with three circulating strains that affected all age 
groups [25]. The PBS may have picked up a higher baseline 
activity of ILI that was missed by IMS because of an under 
representation of older and younger age groups. Second, the 
higher baseline activity of ILI in PBS in the first part of season 2 
coincided with the start of the respiratory syncytial virus (RSV) 
season [26], possibly resulting in influenza-like symptoms, mainly 
among children. Lastly, after having been confined to Stockholm 
County for five seasons, a PBS surveillance cohort of the same size 
as the previous cohorts in Stockholm was drawn from the whole 
country (0.12% of th(; Stockholm population vs 0.02% of the 
Swedish population). Local and regional variations in influenza 
surveillance and epidemiology may have affected comparability. 

Limitations 

The elements assessed in this evaluation provided insights about 
the functionality of IMS and illustrated differences between the 
two community-based surveillance systems. However, structured 
evaluation of other aspects, such as timeliness, flexibility, stability 
and resources needed, may provide further understanding about 
the usefulness of IMS [17]. Moreover, evaluation of only two 
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seasons that provided somewhat deviating findings, possibly due to 
differences in the geographical distribution of the sample and 
during some periods the small sample size, makes generalisation of 
the results difficult. 

Conclusion 

In conclusion, the self-recruited IMS participants reflected the 
demography of the Swedish population poorly. Yet IMS offered a 
reasonable representation of the temporal ILI pattern in the 
community overaU during the 2011-2012 and 2012-2013 
influenza seasons and could be a simple tool to collect 
community-based ILI data. However, invited IMS participants 
represented the target population better than the self-recruited and 
completed a larger proportion of reports. Therefore, personal 
invitations to a random sample of the population may improve the 
quality and usability of IMS surveillance data. 
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